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Abstract. We study the basic preferential attachment process, which 
generates a sequence of random trees, each obtained from the previous 
one by introducing a new vertex and joining it to one existing vertex, 
chosen with probability proportional to its degree. We investigate the 
number Dt(£) of vertices of each degree £ at each time t, focussing par- 
ticularly on the case where I is a growing function of t. We show that 
Dt(£) is concentrated around its mean, which is approximately 4t/£ 3 , 
for all £ < (t/ log t) -1//3 ; this is best possible up to a logarithmic factor. 



1. Introduction 

In this paper, we study the basic preferential attachment process, which is 
defined as follows. We start with a (small) tree on tq > 1 vertices. At each 
integer time t > tq, a new vertex arrives, and is joined to one existing vertex; 
a vertex is chosen as the other endpoint of the new edge with probability 
proportional to its current degree. Thus, at each time t, we have a tree with 
t vertices. The random tree obtained at any time t is called the preferential 
attachment tree. 

The first appearance of this process can be traced back at least to Yule [25J 
in 1925, and in probability theory the model is sometimes referred to as a 
Yule process. Subsequently, Szymahski [22] studied the preferential attach- 
ment process in the guise of plane- oriented recursive trees. He gave a formula 
for the expected number dt(l) of vertices of degree I at time t, namely 

The structure of such trees was further analysed by Mahmood, Smythe and 
Szymahski [17], and by Mahmood and Smythe [16] . Lu and Feng [12] proved 
a concentration result for the random number Dt(£) of vertices of degree £, 
for fixed I. 

Interest in the model surged after a paper of Barabasi and Albert [2] in 
1999, who proposed preferential attachment as a model of the growth of "web 
graphs", i.e., graphs possessing many of the same properties as "real- world 
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networks" such as the worldwide web. Barabasi and Albert studied not just 
the preferential attachment process as defined above, but also the variant 
where each new vertex chooses some fixed number m > 1 of neighbours. 
For m > 1, the preferential attachment graphs produced are of course not 
trees, but for properties such as the degree sequence, the overall pattern of 
behaviour is the same for any fixed m. 

Preferential attachment graphs were studied formally by Bollobas, Rior- 
dan, Spencer and Tusnady [4], who proved that the degree sequence follows 
a power law with exponent 3, i.e., the expected number d™(£) of vertices of 
degree £ at time t is of order t/£ s , more precisely 

df{£) ~ 2m(m + l) 



+ m){£ + m + 1)(£ + m + 2) 



for all £ < t 1//15 . They also showed that the random number D™(€) of 
vertices of degree I at time t is concentrated within 0(\/t logt) of its expec- 
tation They further indicated how the results could be extended to 
somewhat larger values of £. 

Szymahski [23] gave a more precise estimate for d\{£) = dt{£). Combin- 
ing this with the concentration result of Bollobas, Riordan, Spencer and 
Tusnady [5] shows that Dt(£) is concentrated within a factor (1 + o(l)) of 
its mean for £ up to nearly t 1 / 6 . Szymahski [23] also gave a precise esti- 
mate for the expected number ut(£) of vertices of degree at least £, namely 
2t 

n *W = TTo — TT ^(l) - Janson [11], extending a result of Mahmoud, 

Smythe and Szymahski |17| . proved a central limit theorem for Dt(£) as 
t — > oo, jointly for all £ > 1. 

A much more general model was introduced and studied by Cooper and 
Frieze [6], and Cooper [5]: in the latter paper, Cooper proved a general 
result that implies (weak) concentration for Dt(£) whenever £ < t 1 / 6 / dog 2 1. 

The maximum degree At of the preferential attachment tree is known to 
behave as t 1 / 2 as t — > oo. Mori |19] proved a law of large numbers and 
a central limit theorem for the At: in particular, he showed that Att -1 ^ 2 
converges almost surely to some positive (non-constant) random variable, 
as t — > oo. Further, he showed that the fluctuations of Aft -1 / 2 around the 
limit, scaled by i -1 / 4 , converge in distribution to a normal law. 

Many variants of the preferential attachment process have been studied. 
The limiting proportion of vertices of each fixed degree I has been inves- 
tigated in many different models extending and generalising that of pref- 
erential attachment trees. See for instance, Rudas, Toth and Valko [21] . 
Athreya, Ghosh and Sethuraman [1], Deijfen, van den Esker, van der Hofs- 
tad and Hooghiemstra [8], and Dereich and Morters [9]. See also the survey 
of Bollobas and Riordan [3] for a number of other results on related models. 

Our principal aim in this paper is to prove concentration of measure 
results for D t {£) for all values of £ up to the expected maximum degree. For 
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values of £ above et 1//3 , the expectation of Dt(£) is of order at most 1, and 
all we show is that Dt{£) is, with high probability, at most about logt. For 
values of t at most (tj log t) 1//3 , we shall prove that D t (£) is concentrated 
within about t log t /P of its mean. 

We can write D t {£) = Y?s=l ^( s ' ^> Q> where I(s,t,£) is the indicator of 
the event that the vertex arriving at time s (or, for s < tq, the initial 
vertex labelled s) has degree exactly £ at time t. One would expect that, 
for large t and £ in an appropriate range, most of the variables I(s,t,£) are 
approximately independent of each other, each with mean bounded away 
from 1. This would suggest that the variance of Dt(£) is of the same order 
as its mean, and that D t {£) should be concentrated within about -y/E D t {£) ^ 
y/t/W of its mean. This is indeed the case for constant I: see Janson [11] 
for asymptotic formulae for the covariances Cov(D t (£)/t, D t (j)/t). So our 
concentration result is likely to be best possible up to logarithmic factors. 

Our methods can also be used to prove similar results for the random 
variable Ut(£), the number of vertices of degree at least £ at time t. The ex- 
pectation of Ut(£) is approximately 2t/£ 2 for large t: at the end of this paper, 
we indicate briefly how to adapt our proof to show that Ut(£) is concentrated 
within about \J t log tj£ 2 of its mean as long as £ < t 1 / 2 / log 13/2 t. 

Before stating our results, we specify our model precisely. We start at 
some time ro > 1, with an initial graph G(tq) = (V(tq), E(tq)), with 
|V(to)| = to, l-E^To)) = T o ~~ 1) we think of G(tq) as a tree, although it 
need not be. At each step t > To, a new vertex vt is created, and is joined 
to existing vertices by one new edge, whose other endpoint is chosen by 
preferential attachment, that is, a vertex v is chosen as an endpoint with 
probability proportional to its degree at time t — 1. Note that, if G(tq) is a 
tree, then the graph at all later stages is also a tree. 

Our main theorem concerns the number Dt(£) of vertices of degree exactly 
£ at time t, for all £ > 1 and t > tq. 

Theorem 1.1. Let tq > 4 and tp > lO 5 -^^ — 1 log 3 To be constants. Let 
G(tq) be any graph with tq vertices and tq — 1 edges, and consider the pref- 
erential attachment process with initial graph G(tq) at time tq, and the as- 
sociated Markov chain D = (Dt(£) : t > tq,£ £ N). 
With probability at least 1 — -x, we have 



At 

DM - 



< 120A /^M + 301^1o gW) 



{£+!){£ + 2) 

for all I > 1, and all t > To. 

The parameter ij) is a constant which may be chosen arbitrarily large in 
order to make the probability of failure arbitrarily small; the results are only 
of interest when t is larger than some to(tp). All our results are stated in 
terms of such a parameter (denoted tp or u). 
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We state here an analogous result about the number Ut{£) of vertices of 
degree at least £ at time t, for all £ > 2 and all t > to. (Note that Ut(l) is 
equal to t for each t > to.) 

Theorem 1.2. Let tq > 4 and ip > max (to, 10 5 \/to — 1 log 3 to) be con- 
stants. Let G(tq) be any graph with tq vertices and tq — 1 edges, and consider 
the preferential attachment process with initial graph G(tq) at time tq, and 
the associated Markov chain U = (Ut(£) : t > tq,£ £ N). 
With probability at least 1 — -x, we have 



u t (t) 2/ 



£(£ + 1 

for all £ > 2, and all t > tq. 



< 45 ^Mgg +4xl()Vlog 7 Wi 



We do not give a detailed proof of Theorem 11.21 in this paper, but we 
do give an indication of how to adapt our proof of Theorem 11,11 to give this 
result. The term log 7 (i[)t) appearing above could certainly be improved with 
more work. 

Theorem 11.21 seems to be the first explicit result concerning concentration 
of measure for Ut(£), although some weak concentration can be deduced 
from concentration results for D t {£). Moreover, Talagrand's inequality |24] 
can be applied readily: to demonstrate that Ut{£) > x, a certificate of length 
at most 0(x£) suffices (see for instance |18j for details of the method). This 
method gives concentration for Ut(£) up to about £ = i 1 / 3 , and indeed 
concentration for D t {£) up to about £ = i 1 / 5 . 

For constant values of £, Bollobas, Riordan, Spencer and Tusnady [1] 
showed that Dt(£) is concentrated within about i 1 / 2 of its mean, which 
is best possible; a similar result for Ut(£) follows. For larger values of £, in 
particular where £ is growing as a small power of t, earlier methods (including 
the method based on Talagrand's inequality that we mentioned above) do 
not give the "optimal" concentration of D t (£) or Ut(£) about their respective 
means. Our results above do give what should be optimal concentration, up 
to logarithmic factors, for Dt(£) and Ut(£), whenever the expectations of 
these random variables tend to infinity, again up to logarithmic factors. 

In Section [21 we give an exposition of a method based on exponential 
supermartingales, that is widely used in the analysis of continuous time 
Markov processes. We transfer the method to the discrete time setting, 
and state two theorems that we shall use, and that can be applied in other 
similar contexts. 

In Section [31 we apply our method to describe the evolution of the degree 
of a fixed vertex in the preferential attachment model. We do this partly 
to illustrate the method, but mostly so that we can use the results in later 
sections. We prove a result on the maximum degree At that is weaker than 
Mori's |19j . but simple to prove, in the interests of keeping the paper self- 
contained. 
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Sections [H to [6] are devoted to the proof of Theorem ll.il SectionH]contains 
the main thread of the proof, and we defer some calculations to Sections [5] 
and [6l One difficulty we face is that we cannot get sharp results by working 
directly with the natural martingale associated to the Markov chain D = 
(Dt {£) : t > ro , £ £ N) , so we work instead with a suitable transform of 
that martingale. Proving concentration of measure for the transform is not 
straightforward, so we introduce another Markov process derived from D, 
and apply our methods from Section [2] to that process. 

Section [7] contains a brief sketch of the proof of Theorem 11.21 

In this paper, we deal only with the preferential attachment tree. How- 
ever, our methods will extend to more general settings, and indeed we believe 
we can prove results similar to those above for the general Cooper-Frieze 
model. We intend to address this elsewhere; in a very brief final section, we 
make a few remarks on the difficulties involved in extending our proof to 
other preferential attachment models. 

2. Our method: exponential supermartingales 

The following technique is adapted from a fairly standard method used in 
the analysis of continuous-time random processes; see for instance [7], |13j 
and |15j . We have not been able to find a suitable account in the literature of 
a discrete-time, and time-dependent, version of the method for us to quote, 
so we develop the theory here. We provide results that we hope may prove 
useful in other settings. 

Let X = (Xt : t £ Z + ) be a discrete-time Markov chain, possibly time 
non-homogeneous, with countable state space E and transition matrix Pt = 
(Pt(x,x') : x,x' G E) at time t. (Here and in what follows, our matrices 
- which will normally be infinite - have rows and columns indexed by the 
countable set E.) Let (J~t) be a filtration, and suppose that (Xt) is adapted. 

Let / denote the identity matrix. Further, let us write, for a matrix A, 

(Af)(x)= A(x,x')f(x'). 
x'eE 

Then we see that 

[{P t -I)f]{x)= Y,[Pt{x,x')-I{x,x')]f{x')= ^2P t (x,x')(f(x')-f(x)). 

x'£E x'£E 
Further, note that 

[(P t - I)f](x) = E[f(X t+1 ) - f(x) | X t = x], (2.1) 

that is, [(Pt — I)f](x) is the expected change in / at the t-th step given that 
X t = x. 
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Lemma 2.1. Suppose Xq = xq a.s. Let f : E — > R be a function such that 
E[\f(Xt)\ | Xq = xo] is finite for each t. Then 

t-i 



Ml = f(X t )-f(X )-J2l(Ps-I)f](X s ) 

t-l 

= f(x t ) - f(x ) - ^^> s (x s y)(/(x') - f(x s )) 



s=0 x' 

is an {J- 1) -martingale. 



Proof. The proof for the time homogeneous case can be found in Norris [20] . 
Checking that m{ 
as easy. Consider 



Checking that m[ is a martingale in the time non-homogeneous case is just 



E[M t f +1 \T t ] = E 



t 

f(X t+1 ) - f(X ) - £[(P S ~ I)f](X s ) I F t 

s=0 

E[f(X t+1 ) | X t ] - /(X ) - [{P t - I)f)(X t ) 

t-l 

-Yy(P s -i) f ]{x s ) 

s=0 

t-l 



s=0 



f(x t )-f(x )-Y,[(Ps-i)f\(x s ) 

where we used (|2. 1|) . Also, for each t > 0, 

t-i 

E\M{\ < E\f(X t )\+E\f(X )\ + ^E\[(P s -I)]f(X s )\ 

s=0 
t-l 

< E|/(X t )|+E|/(X )| + ^(E|/(X s )|+E|/(X s+1 )|) 

s=0 

< oo. 



□ 



Lemma 2.2. Suppose Xq = xq a.s. Let f : E —> M + be a function. Then 

t-i 



z * =77^ exp -Z. 



/(*>) ^ ^ /(X s ) 
is an Tt-supermartingale, as long as EZ[ < oo /or a// 1. 
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Proof. Consider 

nz( +l \H = E[/(X W ) 1 X t ]-±- exp ( - £ m ~Pj\ {Xs) 



f{x t ){p t f)(x t ) ( W)(x t ) 



f(X ) f(X t ) V /(X, 



t-1 



X exp(-£- l(P "- /)/K ** 



f(x ) — v /(x t ) ; v /(x, 



xexp(-E 1(Ps " /)/](Xs 



- o /TO 



7 / 



where we have used the fact that ¥.[f(Xt+i) \ Xt] = Ptf, and the fact that 
x < exp(— 1 + x) for all x. □ 



Note that, for a continuous-time Markov chain, the analogue of Z\ in 
Lemma [2.21 is in fact a martingale; see for example Lemma 3.2 in Chapter 4 
of [10]. In the time-continuous case, the matrix {Pt — I) is replaced by the 
generator matrix At of the Markov chain, which is the derivative at time t 
of its transition semigroup Pt- 

We shall show how, under certain conditions, Lemma [2. II and Lemma [2.21 
can be used to prove a law of large numbers for a Markov chain. 

Lemma 2.3. Let g : E — > 1R be a function, and suppose that Xq = xq a.s., 
for some xq E E. For 6 £ R, let 

tf(x, e)=J2 Ps(x, x') (e e ^')-^)) - 1 - 8(g(x') - g(x)j) . 

Then 

t-i 

Z?(8) = exp (eM t 9 -J2v 9 s(Xs,0)) 

s=0 

is an Tt-supermartingale, as long as 1 &Zf{9) < oo for each t. 

Proof. The result is a consequence of Lemma l2.2l with f(x) = e 9 ^ 9 ^~ 9 ^ X0 ^ . 
That lemma tells us that z( is a supermartingale, and we need only verify 
that z( = Zf(9) for this choice of /. 
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The calculation goes as follows: 



7 f f(Xt) ( ^[(iW)/](X s 



exp(%(X t ) - 5 (X ))) 

x exp ^ ffite{X.)-g{X )) 

t-1 



s=0 

t-1 



exp (%(X t ) - 5 (X )) - ^^P^aO^'H**^ " 1] 

s=0 x> 
t-1 

exp (e{g{X t ) - g(X )) - 6^ E n(X s , - <?(X S )) 
t-i 

t- 

exp (flMf-^^pQ, 



s=0 



□ 



Note that, while remains in a 'good' set St of states x where e 8 ^ 9 ^ x '^~ 9 ^ x ^ 
is bounded by some constant (possibly depending on t) over all x' such that 
Pt(x,x') > and all x G St, (i.e., the size of changes in g stays uniformly 
bounded), then the finiteness assumption of Lemma [2 . 3 1 holds . Furthermore, 
we can approximate e e ^ 9 ^ x '^~ 9 ^ Xt ^ using a Taylor expansion. 

In many applications, in particular those in this paper, \g(x') — g(x)\ will 
be uniformly bounded over the entire state space E and over all transition 
matrices Pt- if we work up to some fixed time r, then it suffices to have the 
bound valid for t < r. We assume from now on that, for every r > 0, there 
is some real number J = J(r) such that g satisfies: 

sup sup \g{x') — g(x)\ < J < oo. (2-2) 

S<T,X x > : P 3 ( XtX >)^0 

Now we fix some real number a > 0, and restrict attention to values of 9 
such that 1 6 \ < a. We use the identity 



1 - z = z 2 



f e rz (l-r)dr 

Jr=0 
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to deduce that 

<f%(x,0) = J2P 8 {x,x')6 2 (g(x') -g{x)f /* e r %( a; ')-sW)(i - r ) dr 

< 9 2 TP s (x,x')(g(x')-g(x)) 2 e aJ f (1 - r) dr 

= \e 2 e aJ Y,Ps{*^)(9{x)-g{x)) 2 . 

x> 

Suppose that Xq = xq a.s., for some xo G E, and that we study the chain 
up to some time r > 0. Our aim is to show that remains small over the 
period < t < r. For a precise statement, we need a few more definitions. 

We set 

t 

s=0 x> 

so that 

Z?(8) > exp {0Mf - l -e\ aJ $U(X)) , 

for all 9 with \0\ < a. 

Now let R be a positive real number, and set 

T R = inf{t > : $f (X) > R}. 

Thus, for t < Tr, we have <I ) j_ 1 (X) < R, and therefore 

Z?(0)>exp (eM?-\e 2 e aJ R), 

provided \9\ < a. 

Also, for 5 > 0, we define 



9 

and 



T+(6) = inf{t : Mf > 5}, T~(5) = inf{t : M? < -5}, 



T g (6) = T+(8) A r-(«5) = mf{t : \Mf\ > 5}. 



Lemma 2.4. Fix r > and R > 0, and let g : E ^ be a function 
satisfying (|2.2p /or some Jgl. ^4/so, let a > and 5 > be any constants 
such that 5 < e aJ aR. Then 



{T g (S)<T R Ar)<2e- 52 /^ J \ 



and hence 

IT I ( sup 

In •particular: 
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(i) for any uj < R/J 2 , we obtain the following by choosing a = log 2/ J 
and 5 = V ujR: 

u9\ 



sup \Mf\ > VuoR A (Tr > t\ 

0<t<T J 

< p(t 9 (V^r) <t r at 

< 2e- w / i ; 

(ii) for any uj > R/J 2 , we set a = -j log (2ujJ 2 /R) and 5 = ujJ, and 
obtain 

' sup |Mf | > ojJ) A (T R > rj) < P (T g (uJ) < T R At] < 2e" w/4 . 

0<t<T ' ^ ' 

Proof. Fix any real 9 with |#| < a. For ease of notation, we write T+ for 
T+(<5) and T~ for T 9 - (<f). By Lemma [2T3l (Zf(6)) t >o is a supermartingale. 
On the event {T+ < Tr At}, we have > 5 and 



Zj+(0) > exp (#<5 - Ve Q V) 



E[Z 9 + (9)]<E[Z 9 (9)} = 1. 

1 a 



By optional stopping, 

Hence, using the Markov inequality, 

F(T+<T R Ar) < pfe + (0) > expose- h 2 e aJ R^j 

< exp (-59 + ^9 2 e aJ R^j . 

Optimising in 9, we find that 9 = 5/e aJ R is the best choice, and note that 
\9\ < a. This yields 

P(T+ < Tr At) < exp (~5 2 / 2e aJ R 

An almost identical calculation gives 

P(T~ < Tr A t) < exp (-5 2 /2e aJ R 



and the first part of the result follows. 

The two special cases are obtained by choosing the given values of a and 
5, and verifying that 

■-2 

5 < e aJ aR and r- > % 

2e aJ R ~ 4 

in each case. □ 
We summarise what we have proved in a theorem. 
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Theorem 2.5. Let X = (-X~t)t e g+ be a discrete-time Markov chain, with 
countable state space E and transition matrix Pt at time t, and suppose that 
(X t ) is adapted to a filtration {Ft). Let g : E — > R be any function, r any 
natural number, and J any real number, satisfying 

sup sup \g{x') — g{x)\ < J. 

s<t,x£E x'eE:P s (x,x')>0 

Set 

t 

W=EE Ps{X s ,x'){g{x')-g{X s )) 2 . 

s=0 x'EE 

Let R > be a real number, and set 

T R = m£{t > : $? (X) > R}. 

Then 

t-i 

Mf = g{X t ) - g{X ) -J^Y, Ps{X s ,x>){g{x>) - g{X s )) 

s=0 x'EE 

is an {J- 1)- martingale and, for any uj > 0, 

FN sup \M$ | > max (y/uR, ujj \ A (Tr > t) \ < 2 e - Lu/4 . 

We have demanded that the state space be countable, so that we can 
express our results in terms of sums over the state space. It suffices to assume 
instead that, for any state x £ E, and any time s, there is a countable set 
E{x,s) C E such that YLx'eE Ps{%, x> ) = 1- Indeed, under this assumption, 
if Xq = xq a.s. for some state xq, then there is a countable set E' C E such 
that, a.s., X t G E' for all t > 0. 

We also remark that, in the statement above, we begin our consideration 
of the chain at time 0. When applying Theorem 12.51 in the analysis of the 
preferential attachment tree, we shall instead start at some fixed time tq: of 
course this makes no substantive difference. 

In some instances, for example in Section we will want to bound the 
probability that \Mf\ < 5{t) for all t < r, where 5{t) is a suitable function 
growing with t. One easy way to do this is to apply the above theorem for 
each value t < r, choosing an appropriate value R{t) of R at each time. This 
approach has the drawback that it is necessary to sum the probabilities of 
failure over t < r. Better bounds may be obtained by applying the lemma 
only for a sparse sequence of values t, as we illustrate in the proof of the 
following result. 

The notation here is essentially as for Theorem 12.51 We again have a 
real- valued function g defined on the state space £ of a Markov chain X, 
and the change in g is uniformly bounded by J over all possible transitions 
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of the chain. The function &f (X) is as in Theorem 12,51 Now we have a 
non-decreasing function R : Z + — > M + , and we set 

T R = inf{t > : $f (X) > R(t)}. 

Also, for any non-decreasing function 5 : Z + — > M + , we define an associated 
stopping time 

T g (S) = M{t > : \M$\ > 5(t)}. 
With the notation as above, we have the following result. 



Theorem 2.6. (a) Fix uo > 4, and let 5{t) = max(wJ, 2yJuoR(t — 1)) 
for t > 1. Then, for any r > such that R(t — 1) > ooJ 2 , 



?(T g (8)< // 1 -AD<21og( 8E(r T / ) ), 



,-w/4 

(b) Fix ^> > 4/ J 2 , and suppose that R(t) tends to infinity as t — > oo. 



For t > I, let 5(t) = 2 max (V>Jlog(V></ 2 ), yV^O - l)log R(t - 1) 

V(T a (S) < T R ) < 5e~^/ 4 . 

Proof. For (a), we define a finite sequence of times r%, T2, ■ ■ ■ as follows. Let 
T\ be the first t for which R(t) > wJ 2 : by assumption T\ < r. For j > 1, if 
tj < t then we set 

T j+1 = ia£{t > Tj : R(t) > 4 j R(t j )} A r. 
The final term tm in the sequence is the first Tj with Tj = r, and the number 
N of terms in the sequence is then no greater than 2 + log 4 (^^jt^\ < 

We first apply Lemma l2.4l fii) with r = t\ and i? = R(j\ — 1), noting that 
to > R{t\ — 1)/ J 2 by definition of t\. We obtain that 

f(T g (uJ) < T R A n) < F(T g (uiJ) < Tr^^ A n) < 2e^^. 

As <5(t) > ojJ for all i < ti, this means that, with probability at least 
1 - 2e- w / 4 , |Aff | < <5(t) for all times i < Tr A t\ . 

For each of the times r = Tj (j > 2), we apply Lemma l2,4f i) with R = 
R(Tj — 1), noting now that ui < R(rj — 1)/J 2 by choice of t\. We obtain 
that 

T g (JuB^j - 1)) < T R A T^j < 2e-"/\ 
For each t with r,-_i < i < Tj, we have 



<$(*) > <%-l + 1) > 2^U>R(Tj^ 1 ) > \JuR(Tj - 1), 

since -R(tj — 1) < 4i?(Tj_i) by definition of Tj. We conclude that, for each 
j > 2, with probability at least 1 - 2e~ w / 4 , \M?\ < S(t) for all times t with 

Tj-l < t < T R A Tj'. 
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It now follows that, with probability at least 1 — 2 A^e _aJ//4 , we have \Mf\ < 
5(t) for all times t <Tr At, and part (a) follows. 

The proof of part (b) is very similar in style. This time we let t\ be 
the first t for which R(t) > 2ipJ 2 log(tpJ 2 ). Given Tj, we let Tj+i be the 
minimum t such that R(t) > 4R(r,). The assumption that R(t) tends to 
infinity ensures that we obtain an infinite sequence (jj)j>\ of times. 

We apply Lemma l2T4T ii) with a; = 2ip log(ifjJ 2 ), r = t\, and R = R(ti — 1), 
noting that ui > R{t\ — I)/ J 2 by choice of T\. We obtain that 



(T 9 (2^Jlog(^J 2 )) < T R A n) < 2e 



Since 5(t) > 2tpJlog(ipJ 2 ) for all t, this implies that, with probability at 
least 1 - 2e"^/ 4 , \Mf \ < 5(t) for all times t < T R A r x . 

Now, for each r = Tj (j > 2), we apply Lemma l2.4f i) with R = R(tj — 1) 
and to = tp log i?(rj_i) for each j. We need to check that oj < R{jj — 1)/ J 2 , 
i.e., that R(rj — 1)/ log R(tj-i) > ipJ 2 : we have 

R(t 3 - 1) g(7j_i) 2y>J 2 log(y> J 2 ) 2y>J 2 log(V>J 2 ) _ 2 



log-Rfa-i) " logRirj-x) ~ log(2^J 2 log(VJ 2 )) 21og(^J 2 ) 

as required; here we used the facts that (i) R(jj—Y) > R(rj-i) > 2tpJ 2 log(tpJ 2 ), 
by the definition of the r», and R(-) is increasing, (ii) ipj 2 > 4 and x/logx 
is an increasing function, with minimum value e, for x > e (so in particular 

21og(V>J 2 ) < ^J 2 )- 
We obtain: 

P (r a (j^R{ Tj -l)log Rirj-t)] <T R A Tj ) < 2e -^ R ^l\ 
We have, for Tj-\ < t < Tj, 



S(t) > 5{rj—\) > 2^i?(r 7 _i)logi?(T i _ 1 ) > ^Rfc - 1) log Rfa-x) 

and so, with probability at least 1 - 2e-^ lo § H ^- 1 )/ 4 , \M?\ < 6(t) for all 
times t with Tj_i < t < Tr A Tj. 

Therefore, summing over j, and noting that i?(r,-_i) > 2tpJ 2 log(ipJ 2 )A : >~ 2 > 
e-? -1 for each j > 2, 

oo 

Affl > tf(t) for some i < Tr) < 2e~^/ 4 + 2 e^O'" 1 )/* 
as desired. □ 
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3. Evolution of the degree of a vertex 

For the remainder of the paper, we will use the results of the previous 
section to analyse various aspects of the preferential attachment process. 

Our first, relatively simple, application is to the evolving degree of a single 
vertex; loosely, we prove that, if a vertex has degree k at time s, its degree 
at a later time t is unlikely to be far from k^/t/s. Results of a similar 
flavour are in the literature already (see for instance Cooper [5], Athreya, 
Ghosh and Sethuraman [1] and Dereich and Morters [9]); we give them here 
partly to illustrate our methods and partly because we shall have need of 
the results from this section later on. 

We assume as always that our process starts at some time tq, with tq 
vertices and tq — 1 edges. At each stage, one new vertex and one new incident 
edge are created, so that at each time s > tq there are s vertices and s — 1 
edges. We identify the vertex set at time r with the set [tq] = {1, . . . , r }, 
and label the new vertex arriving at each later time s with s, so that the set 
of vertices present at time t > To is exactly [t] . 

For a vertex v, and t E N with t > max(ro, v), let Xt(v) be the degree of 
vertex v at time t. For tq <t < v, we set X t (v) = 0. 

Set (X t ) = (X t (v) : v = 1,2,...); it is easily seen that (X t ) is a Markov 
process. Indeed, each component X t {y) is separately a Markov process. For 
v G N, we take f v (x) = x(v), the degree of vertex v in state x; the function 
f v is the projection of the state onto its v-th component. In what follows, 
we shall assume that v < to, so that vertex v is present in the graph at time 
to, and we let mo = X T0 (v) be its degree at the initial time. 

We now want to calculate the corresponding martingale from Lemma [2. 11 
First we note that 

E[f v (X t+1 ) - f v (X t ) | X t = x] = [(P t -I)f v ](x) 

x(v) 
2(*-l)" 



This is because the sum of all vertex degrees at time t is 2{t — 1), and 
the probability that a vertex w is chosen as the endpoint of the new edge 
from vertex t at time t, conditional on Xt = x, is proportional to its degree 
x(w), and therefore the conditional probability that vertex v is chosen is 
x{v)/2{t-l). 

By Lemma |2. 11 we know that the process M{v) given by 



t-i 



M t (v) = f v (X t )-f v (X T0 )-Y,[(Ps-I)fv](X s 



S = T 



t-1 



S=TQ 



X s (v) 
2(s - 1) 
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is a martingale. We re-write the above as 

X t (v) = M t (v) +m +Y, ^TTTTT- 

Let Xf solve the recurrence relation 
x t+1 = x t ( 1 + 



' ' X s (v) 
2(s-l)' 

S=T V ' 



2(t - 1) 



for t > To, with x T0 = mo- A simple induction argument shows that, for all 
t > t , 



, t- 1 / t - 2 

r^/-— I < at < mo y — g. 

Provided tq > 4, we have 



'- 2 <, lii 1+ > <?''-' 



ro-2-Vr -lV 2(r -2)y " 4V ro-1 
and so 



5 / t- 1 . , 

zt < T m 0\/ r- (3.2 

4 V t - 1 



Now fix any oj > 4. For a vertex v, we define the time 



V;. - inf'<{ / > r : A', ( r) > «»0w 3 m 0A /^ y 



Then for tq < t < T v , we have Xt('u) < 60a; tuq y _ t . 

Let Et = Xt(v) — Xf, with E TQ = 0; we want to bound \Et\. Substituting 
Xt(v) = xt + Et in (|3.ip . and using the recurrence 

t-l 

EX g 
2(s -iy 

S=T V ' 



we obtain that, for tq < t, 

\E t \ < \M t (v)\ + V '^ s| - . 
tv ;| 4^ 2(s- 1) 



125, 



S = TQ 



For To < s < T„, X s+ i(t>) — X s (v) is either or 1, and the probability 
that it is equal to 1, conditional on X s , is 



^„ n 3 
< 3(krm 



2( s _i) - U V (s-l)(r -l)' 
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So, for t < T v , we have 



S = T() x' 

< 30w 3 m 01 

S = TQ 



( s -l)(r -l) 



< 60w 3 m 



70-1 



We now apply Theorem l2.6f b) to the function /„, with R(s) = 60w 3 moy _ , 
J = 1, and with if) = u). Thus we have 

tf(t) = 2 max fwlogo;,y60u; 4 molog (60£J 3 m \/(* ~ ~ 1)) (t~~t) 



for ro < i. The result implies that, with probability at least 1 — Se - ^/ 4 , we 
have \M t (v)\ < 5(t) for all t with r < t < T v . 

We thus obtain that, with probability at least 1 — 5e~ w / 4 , for tq < t < T v : 

\ Et \ - 2 2 (f- l l) +8a;2 ^Q^l) 7 \/ log (60^ moA /(t - l)/(r - i] 



S=TQ 



Now log(xy) — y 1 / 3 logx is decreasing in y for y > 1 when x > 20, and is 
zero at y = 1: we apply this with cc = 60w 3 mo and y = y/(t — 1)/(tq — 1), 
to obtain that 

, ; v / + _ ! \ 1/6 

log (60w 3 m \/(* " 1)/(t-o " 1) j < y) log(60a; 3 mo) 



2/ t-l 



1/6 

< — fij l°g(2m ), 

for any uiq > 1, and so 

11 \E I / i — 1 \ X ^ 3 

1^1 - 12 2(1 - i) +8^V"^olog(2m ) ( — j j • 

We analyse the recurrence above using the following simple lemma. 

Lemma 3.1. Let A be a positive constant, andr^ a positive integer. Suppose 
the sequence et, for t > To, satisfies e T0 = and 



_ o 2( S -l) V-o-1 
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for all t > to. Then 

e t < e * t = 6A 

for all t> tq. 



t - 1 



t - 1 3 V t - 1 



t - 1 



1/3' 



Of course, the conclusion that we shall use is that et < ^>^\J but the 
bound above is easier to establish by induction. 

Proof. The proof is by induction on t, the result being true with something 
to spare for t = tq. 

Suppose the result is true for all s with tq < s < t. Then, by the induction 
hypothesis and the recursive bound, we have: 

t-i 



S = T 



2(s - 1) 



+ A 



t-1 
ro-1 



1/3 



t-1 

^ s - 1 



S=TQ 



s-l 2 / s- 1 



1/3' 



t - 1 3 Vro - 1 

s-l 2(s-lV /3 



+ A 



t - 1 



1/3 



Now the function q(s) = — l -r K , , , . 

3 v ' S—l \l Tq — 1 3 \ To — 1 

s > To, so g(s) < 1/3to for all s > To, and we have 



ro -I, 
is decreasing for all 



S = T() 



3(r - i; 
1 

3(ro - i; 



+ 



+ 2 



s-l 
ro-1 

t-1 
ro-1 



1/2 



1/2 



This gives 

e t < 



A 



< 6A 



+ 6A 
(ro - 1) 

t _ 1 \ V2 



t-1 
T - 1 



1/2 



5A 



s-l 
ro-1 

t-1 
ro-1 



t-1 

T - 1 



1/3' 



TO 



1/3' 



1/3 



,To-l, 

This is the desired inequality for et- 



4A 



t - 1 
ro-1 



1/3 



□ 



We can now deduce that, with probability at least 1 — 5e W//4 , for To < 
t<T v , 



\Et\ < 48wV m log(2m )W- -, 

V T - 1 



(3.3) 
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and this bound is at most A8uj 3 mo\ . Combined with (13.21) . this 

V r - 1 

implies that Xt(v) < 50uj 3 mo^ ^ for all times t <T V . We deduce that 

T v = oo, since otherwise this would contradict the definition of T v . This 
means that, with probability at least 1 — Se - ^/ 4 , the bound (|3.3|) is valid for 
all times t > tq. 

We thus have the following theorem. 

Theorem 3.2. For all u> > 4, tq > 4, and mo > 1, we have 



\X t (v) - x t \ < A8oj 3 ^m log(2m )^ ^_ for all t > r X ro (t?) = m ^ 

> i _ e -5w/4 j 

and therefore 



Xt(v) — 50uj 3 mQ\ /or a// i > ro X T0 (f) = mo 

V r - 1 

> 1 _ e -5^/4_ 

We note two consequences of the result above that we shall use later. 
Corollary 3.3. For to > 4, uj > 4 and mo > 10 5 cj 7 ; we have 

X t (v) < 2m ^^-—J f° r al1 1 ^ T o ^ro(v) = ro j > 1 - e" 5tj/4 . 



Proof. By (|3.2f) . we have xt < j m oy TQ -i for all t > tq, given the initial 
condition x T0 = mo- 

The result will then follow from Theorem 13.21 as long as 



48cA/ 1Og(2m0) < I 
V m 4 



We see that 



log(2m ) ~ log(2 x 10 5 u; 7 ~ log (2 x 10 5 x 4 7 ) 
which implies the desired inequality. □ 

We shall also use the following result, stating that the maximum degree 
at time t is unlikely to be larger than ip\ft — 1, where ip is a large constant. 

Theorem 3.4. Let to > 4 and ^ > 10 5 \/ r o ~~ 1 l°g 3 r o ^ e constants. For 
the preferential attachment model, with any initial graph on to vertices and 
tq — 1 edges, 



' (X t (v) > ipyt — 1 for some vertex v and some t > tq) 
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Proof. Let Pi be the probability that Xt(v) > ijiy/t — 1 for some t > To and 
some vertex v already present at time ro, and P2 be the probability that 
Xt(v) > ifty/t — 1 for some t > tq and some vertex v arriving at a time later 
than to. 

We begin by bounding P\. For a fixed vertex v present at time ro, its 
degree at that time is certainly at most ro — 1. We apply Theorem 13.21 with 
m Q = To _ 1, and u = (^/50)i/ 3 (r - 1)~ 1/6 > 4, so that 



50u 3 m J- < 50o; 3 v /; T^ r Iv / t- 1 = ipy/t - 1. 

V T - 1 

We obtain that 

v 1/3 



P (X t (v) > ipy/t-1 for some t > r ) < e~ 5w/4 < exp 
We therefore have 

Pi < t exp 



3(r - 1)1/6 



^1/3 



3(td- i)V6 y 

We now bound P^. For each time s > t$, consider the new vertex v of 
degree 1 born at time s. We apply Theorem [321 to this vertex, with ttt-o = 1, 

-1 In j 

ro replaced by s, and ui = [tp\/s — 1/50) , so 50u; 3 mo-i/|5T < — 1, 
and we have 

P (X t (v) > TpVt^l for some t > s) < e~ 5w/4 < exp 7 ^ ~ ^ 7 j . 

Summing over s, we have 

The overall probability that there is, at any time t, a vertex of degree at 
least ip\/t — 1, is thus at most Pi + P2 < 2Pi, as claimed. 

For the final inequality, we need to show that our bounds imply that 

/(V) = V 1/3 - 3(r - 1) 1/6 log(2roVO > 0. 

We note that f'(ip) = 3^ [if) 1 / 3 — 9(r — l) 1 / 6 ] > 0, so it is enough to verify 
that the inequality holds at tp = W 5 ^tq — 1 log 3 ro, at which point 2i(j < 
Tq 2 for all ro > 4. The desired inequality is equivalent to IO 5 / 3 log(ro) > 
31og(2r # this holds since IO 5 / 3 > 39. □ 
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4. Concentration for D t {£) 

In this section, we again consider the basic preferential attachment model, 
but now we are concerned with the number of vertices of degree exactly £ 
at time t. 

Recall that, for t > tq and £ G N, D t {£) denotes the number of vertices of 
degree exactly £ at time t. It is easy to see that D = (D t (£) : t > to, £ G N) 
is a Markov chain. 

We recall our main theorem. 

Theorem 11.11 Let tq > A and ip > 10 5 \/to — 1 log 3 To be constants. Let 
G(tq) be any graph with To vertices and tq — 1 edges, and consider the pref- 
erential attachment process with initial graph G(tq) at time tq, and the as- 
sociated Markov chain D = (D t (£) : t > tq,£ G N). 
With probability at least 1 — ^, we have 



At 

Dt{£) - 



{£+!){£ + 2) 
for all £ > 1, and all t > tq. 



< 12 ( .\/^) + 301^1og W , 



As is well-known (and as we shall show shortly), the expectation of Dt{£) 
is very close to At/£(£ + 1)(£ + 2), for all £ > 1 and all t > r , so the 
theorem shows concentration of measure of these random variables about 
their means. 

For £ < (t/ log(V't)) 1//3 /V ;2 ) the bound on the deviation of Dt{£) is at most 
ilog(^t) 



125y , which is, up to the log factor, on the order of yTE D t {£). 

We get concentration within a factor (1 + o(l)) of the mean as long as 
£ = o(t/logt) 1 /3. 

For all values of £ larger than {t/ log t) 1//3 , the bound on the deviation 
that we obtain is of order log t. This result might conceivably be of interest 
for values of £ between about (t/ log t) 1 ^ 3 and t 1//2 , but for larger values of £ 
we already have a stronger result: Theorem 13.41 tells us that Dt{£) = when 
£ is larger than i\)\ft, with probability at least 1 — 1/tp. 

The proof of Theorem 11.11 takes up the rest of this section, although we 
defer the bulk of the calculations until later sections. 

Proof. It will shortly turn out to be convenient to truncate the range of £, 
so that we consider only values of £ with 1 < £ < £q, for some fixed £q. We 
remark now that we may freely do this, as we are proving an explicit bound 
on the probability of failure that is independent of £q. 

For the moment though, we consider all values £ G N simultaneously, and 
consider the evolution of the entire process D = {Dt{£)) for t > tq. 

We have, for t > To + 1, 

E[A(i)-A-i(i)|A-i] = i-|^. 
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Also, for I > 2, 

HAM - A-.M I A-J = (< - 1)A -' ( f- 1) - "tlW. 

2(i- 1) 2(t - 1) 

Then, by Lemma |2.1|, 

A(l) = A (l)+E (l-^)+M t (l) 



=T0 
t-1 



2s 2s 

s=-r 

where Mt{£) is a martingale for each I > 1. 

We want to show that, for £ > 1, Dt(£) is close to dt{£), where the c^(£) 
satisfy <i To (£) = D To {£) for all £, and: 

= ^ (i) + E(i (UV 



2s 

S=TQ 



* m = + (f > 2) . 



2s 2s 

S=T1D 



Given the initial values d T0 (£) = D TQ (£), for £ > 1, the equations above are 
equivalent to: 

dt(l) = l + df_i(l)fl ' 



2(t - 1; 



*W = (^ 2 )- 

These equations are known to admit the explicit solution 

At 



dt(£) 



l)(£ + 2)' 



if the initial conditions correspond (which of course cannot happen for a 
concrete graph at time to, since then all the D TQ {£) are natural numbers). 
More generally, we have the following result, which is very similar to results 
of Szymahski [22, 23J and Bollobas, Riordan, Spencer and Tusnady [4]. 

Lemma 4.1. Take any tq > 1, and any sequence (D T0 (£))i>i of non- 
negative integers with Yli>i D^ity = tq. Then the solution dt{£) of the 
equations above with d TQ {£) = D To (£) for all £ satisfies 



d t (£) 



At 



£{£ + !){£ + 2) 
for all I > 1 and all t > tq. 



3/2 
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Proof. Set 

At 

zt(£) = dt(£) - 



£{£ + !){£ + 2)' 
for all t > tq and £ > 1. 

Note first that D TQ {£) < to, for all £, and so also |-z ro (^)| < r o- Thus the 
lemma holds for t = tq. 

For each t > To, it is straightforward to verify that 

1 



and, for £ > 1, 

- 1 



2t(£) = (*) 1 " 7777 77" + Z t -l{£ ~ 1 



2(t - 1) y v 2(t-i)' 

If Z is a common upper bound on |z t _i(£)| and \zt-\(£ — 1)|, we deduce that 

1*01 < Z (l " 7777^-77 + 77^1 = Z f 1 1 



2(t-l) 2(t — 1) y V 2 (t-l} 
By induction, it now follows that 

*-i / 1 \ / — 3/2 

u=ro v ' 

for all i > To and every I > 1, as claimed. □ 

Define -E^) = -Dt(^) — dt(^), where, as above, we set d T0 (£) = D T0 (£) for 
all £ > 1. Note that D T0 (£) is an integer- valued random variable, determined 
by the graph at the initial time To. Note also that E T0 (£) = for all £ > 1. 
For the moment, we shall keep the term E TQ in our expressions, to show how 
the calculation would be affected in a setting where E TQ is not necessarily 
zero. 

For t > To + 1, we have 
E t (l) = £ T0 (l)-gM2+M t (l) 



S=T0 
t-1 



S=T0 

This means that, for i > To + 1, 

E t {\) = E t _ 1 (l)(l-277^ I y)+M t (l)-M t _ 1 (l) 
and, for £ > 2, 

W) = - ^) + if - - " + M,«) - M,- l(< ). 
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At this point, we truncate the process D: we fix some £q > 1, and set 
D e ° = (D e t ° (£) : t € Z+, I = 1, . . . , £ ) - in other words, we restrict attention 
to the numbers of vertices with degrees at most £q. The truncated process 
remains Markov, since the distribution of D t (£) conditioned on D t -\ 
depends only on D t -\(£) and D t -i(£ — 1), for each £ < £q. 

Once we have fixed £q, we may restate the previous system of equations 
as a matrix equation, giving a recurrence for 



Et. 



yE t (£o)J 



We have, for t > tq + 1, 



Et = At-iEt-x + AM t , 



where 



AM t (l) 



with AM t {£) 
is given by 



K AM t (£o) 

M t (£) — M t -\{£) for each £, and, for s > tq, the matrix 



n-i 

2s 





1 - 

1 2s 







2s 








1 - ; 



Hence it follows that, for t > To + 1, 
/ t-i 



E, 




AM, 



VS=r 



Here and subsequently, the notation 11$=^ ^s indicates the matrix product 
A t -i • • • A TQ , with the indices taken in decreasing order. At this point, we 



recall that E, 



TO 



0, so that 



't-i 



AM, 



S=T0 + 1 \U=S / 

We shall control the deviations of Et, although this process is not itself 
a martingale, and so we cannot directly apply our martingale deviation 
inequalities. The process (Et) is a transform of the martingale (M t ), in that 
it is a sum of its differences, multiplied by the appropriate n«=s w hich 
depend on t. In order to get around this difficulty, we now introduce, for 
each r > ro, a martingale M T stopped at r, whose value at r is the quantity 
E T of interest. 
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We fix r > To and define, for t < r, 

s=ro+l \u=s / 

and Ml = M£ for t > r, then it is easily checked that M r = (M t T ) is 
a martingale, and that = E T . Thus we can obtain bounds on E T by 
studying the martingale M T . 

From Lemma 14.11 we have that, for every t > tq and every £ = 1, . . . ,£q, 

D T (£) = d T {l) + M T T {£) < J + To + M T T {t). 

We now consider the transitions of the truncated process D °, with state 
space (1 1 + Y°. Recall that each transition involves the creation of one new 
vertex of degree 1, and the increase of a degree of an existing vertex by 1. 
This means that a transition of the truncated process involves an increase 
of 1 in D e °(l), and either: (i) a decrease of 1 in D £ °(k) and an increase of 1 
in D £ "(k + 1), for some k £ {1, . . . , £ - 1}, (ii) a decrease of 1 in D io (£ ), 
or (hi) no further change. In other words, the vector D^° +1 is obtained from 
D e s ° by adding one of the following vectors: 

(i) Vk = ei - e k + ejt+i, for some k G {1, . . . , £ - 1}, 

(ii) y to = ei - ee Q , 
(hi) y = ei. 

Here ej denotes the standard basis vector in with a 1 in the jth. coor- 
dinate and Os elsewhere: here and in what follows, we abuse notation by 
suppressing the dependence on £q. The transition probabilities are then 
given by 

P s (D s ,D s +y k ) = (k = l,...,£ ) 

P s (D s ,D s+yo ) = 

k=l S 

Here too we have removed the superscripts £q for clarity. 
We can write, for s > t , 

to 

AM S+1 = D s+l - D s -Y,Ps{D s ,D s + y k )y k 

k=0 

to 

= ^yk[lD 3+1 -D 3 = Vk - P s (D s ,D s + y k )]. 

k=0 

We consider running the process up to some fixed t > t : all our nota- 
tion should specify the dependence on t, but again where possible we shall 
suppress this. 
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For ro < s < t, we define B s = £?J = riu=s+l so * na t 

s-l 

M T S = B W AM W+1 . 

W=TQ 

We then have, for To < s < t, 

AMJ +1 = MJ +1 -MJ = AAM S+1 = S s ^y fc [I Da+1 _ Ds=2/fc -P s (L> s ,£) s +^)]. 

k=0 

Now we define Z? = D e ° by 
A = J2B s J2ykPs{D s ,D s + y k ) 

s=t k=0 
t-1 io 

+ J2 B ^yki I Ds + i-D a = yh -Ps{D s ,D s +y k )}, (t<r), 

S=T0 fc=0 

= £ £ p, ( a, a + f *) [^-yfc] + M t r • 

s=r fc=0 

so that, for t < t, 

t-1 i 

A = Z ^E^+i- '^ 

s=ro fc=0 
t-1 

= ^A(A+i-A) 

s=ro 

and so 

A+i - A = A(A+i - A). 

The process -D is not in general a Markov process. However, we may 
define a process Y = Y T by setting Y t = (A>A) for i < r, and = y r 
for t > t. This extended process 1" is Markovian, with state space E = 
(j+ya x Q]J+)*o_ At each time t with To < t < r, the one-step transition 

matrix Pt for Y is derived from that of D. Specifically, if A+i — A = Uk, 
then A+i - A = Ayfc, and Yt+i - 1* = (y k ,B t yk)- 

Our plan is to apply Theorem 12.51 to the Markov process Y, and, for 
£ = l,...,£o, to the projection function g = g l taking (x,x) G E 1 to x(^). 
For each fc = 0, . . . ,t , if A+i - A = l/k, then g(Y t+1 ) - g(Y t ) = [B t y k ]{£), 
the £-th entry of the vector B t y k . Since A is a product of non- negative sub- 
stochastic matrices, it too is non-negative and sub-stochastic. The vector y k 
has all its entries in {0, +1, —1}, with at most two positive and one negative 
entries, so each co-ordinate of the vector Bty k is a sum of at most two entries 
of Bt, minus at most one other entry. Therefore | [Az/fc] (^) I — 1 f° r au ^ 
and £. So, in applying Theorem 12. 5} we may take J = 1. 
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For £ = 1, . . . , £q, and tq < t < r, we therefore have 



t 



S = T() X 1 

t i 



s=tq k=0 

For brevity, we set <&l(Y) = $*f (Y) from now on. 
For each £ = 1, . . . , £q, we set 

R l = 1600^ + (10V) 4 logOK), 

and, for £ = 1, ... , £q, we set 

r| = inf{t > r : *f (Y) > tf}. 

We now apply Theorem 12.51 with J = 1, R = i? , and u = 9log(tpT), noting 
that ujJ 2 < R £ ; we obtain that, for each r > To and £ = 1,... ,£q, 

sup |M^)I > sJlog^r)^) A (T! > r)) < 

Let Ta = inf{s > tq : D s (k) > for some /c > s — 1}, the first time s 
such that there is a vertex of degree greater than ip^ s — 1: by Theorem [331 
P(f A < oo) < 1/^. Also, for each k = l,...,£ , let 

f h = inf {s > r : D s {k) > 5p + 400V 2 log(#)} • 

Note that, for t < s < mm((k/ip) 2 + l,f A ), £> s (» = 0, and so f k > 
min ((fc/^) 2 , Ta) for each /c = 1, ... ,£q. Finally, let T be the minimum of 
Ti , . . . , Ti . 

In the next section, we shall prove the following result. 
Lemma 4.2. For all t < f A f A , and all £ = !,... ,£ , *f-i00 < R*- 
This result can be restated as saying that Tjj, > T A Ta for each ^ = 

Now, for each t > tq and each £ = 1, . . . ,£q, set 



<y T (i) = 120^ Tl °^ r) + 300^ 2 log(Vr), 

which is slightly less than the bound on the deviation appearing in the 
statement of the theorem. Observe that, for t > tq and £ = 1,...,£q, 

6 T (£) 2 > 14400 rl °gj^ r ) + 9 x no,/,) 4 log 2 (Vr) = 91og(Vr)^. 

Thus we have 



5 T {£) > 3yJ]og^T)R e , 
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and so, using Lemma B~2l 



sup \M[(£)\ > &r(£) A (T A T A > r) 

TQ<t<T J 

< f(( sup \Ml(l)\>5 T (e)) /\(T e R >r) 

\ \T <t<T J 



2 

< 



P ( (\E T (£)\ > 5 T {£)) A (T A T A > r) ) < -|^, (4.1) 



Recall that = .E T for each r > to, so we now deduce that 

2 

l/j 2 T 2 ' 

for all t > tq and £ = 1, . . . , £$. 

We now wish to bound the total probability that there is some pair (r, £), 
with t > t and £ = 1, ...,£ , such that \E T (£)\ > 5 T {£) and r < f A T A : 
recall that we want a bound independent of £q. 

For those pairs (t,£) with £ < V'V 7 ' ~~ 1) we sum the bounds from (|4.ip . 
and obtain that 

|£ T (f)| >^))A(fAT A >r) 

for some r > To, 1 < £ < min(^o, 4>Vt — 1) 

2 ~ Vv 7 ^! 



< 



00 

y 

ib 2 ^— ' t 2 

T=TQ 



OO 



< 1 / r -3/2 dr 

3 

< — . 

For those pairs (r, with £ > VV' r — 1) we have either T A < r or D T {£) = 
0, and in the latter case we have 

\E T {£)\ = d T {£) <lj + ro<^ + i + 10V 2 < St(£). 

Therefore, with probability at least 1 — we have, for all £ = 1, . . . ,£q 
and all r > r , that either \E T (£)\ < 5 T (£) or r > f A f A . 

We now set T/ = inf{s > r : > S s (£)} for each £ = 1, . . . ,£ , and 

T* = min(T;, £ = 1, ... ,£ ); we obtain that 

P(T* < oo and T* < f A f A ) < 3/V>. (4.2) 
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On the other hand, if Ti < for some I = 1, ... ,£q, there is an s > t$ 
with D s (i) > 5s /£ 3 + AOOip 2 log(^s) and 

D.(£) < ^ + r + 5 s (i) 



< 



£ + r + 120^§^ + 3000" log(#) 



+ 36001og(V's)) + 30(h/> 2 log(V's) 



< 5-^ + 350^ log (#), 

which is a contradiction. 

We conclude that T£ < Ti for all I = 1, . . . , £q. Recalling that T is the 
minimum of T\, . . . , T^ and T* is the minimum of T*, . . . , T% , this implies 
that T* < f. Equation jOJ now implies that P(T* < oo and T* < T A ) < 
However, we also have that P(T A < oo) < l/ip, so P(T* < oo) < A/ip. 

This conclusion is equivalent to the statement that 

P (\E S (£)\ < 5 S {£) for all£ > 1 and s > r ) > 1 - -. 



This implies the result stated, since 

4s 



D s {£) 



£{£ + !)(£ + 2) 



<|£sWI+t <|£U£)|+V 2 - 



□ 



5. Bounds for <& l T _ x (Y) 
Our aim in this section is to prove Lemma 14.21 which states that 

r-l £ 

S=T k = 

is at most 

^ = 1600^ + (10^) 4 log(^r), 

whenever r < r < f A T A and 1 < ^ < £ . 

Recall that, for s < r < T A T A , and 1 < k < £q, we have 

D(k)<l° k>ip^=l 
\ 5 7? + 400^ 2 log(V's) fc^^/a 3 !* 

For this section, we may and shall assume that we do indeed have these 
bounds on the values of D s (k). 
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Recall also that, for s = to, . . . , r— 1, B s is the matrix product A T -i • • • A s+ i, 
and that 

'e x k = 

Uk = \ ex - e k + e k+ i 1 < k < £ 
ei-e io k = £ . 

We may now write, for 1 < £ < £q, and tq < s < r, 

[B s y }(£) = [B sei ](£) 

= B s (£,l) 

[B s y k }(£) = \B s e 1 ]{£)-[B s e k ]{£) + [B s e k+1 ]{£) 

= B s {£, 1) - B s (£, k) + B s (£, k + 1) (1 < k < £ ) 

[B s y eo }(£) = [B sei }(£) - [B s e £o }(£) 

= B s (£,l)-B s (£,£ ). 

where [B s ](i,j) denotes the (i, j)-entry of the matrix B s . We then have 

[B s y }(£) 2 = B s (£,l) 2 

[BsVk]{tf < 2B s (e,l) 2 + 2(B s {£,k)-B s {£,k + l)) 2 (1 < k < £ ) 
[B s y eo ](£) 2 < 2B S (£,1) 2 + 2B S (£,£ ) 2 . 

Provided we interpret B s (£o,£q + 1) as equal to zero, we can now bound 
the sum over k, for any s and any £ < £q, as 

to 

Y,Ps{D s ,D s + y k )([B s y k ]{£)) 2 

k=0 

< P s (D s ,D s + y )B s (£,l) 2 

to 

+ 2 Ps(D s , D s + y k ) [B s (£, l) 2 + (B s (£, k) - B s (£, k + l)) 2 ] 
k=i 

to 

< 2B S (£, l) 2 + 2 Ps(D s ,D s + y k )(B s (£, k) - B s (£, k + l)) 2 . 



k=i 



For 1 < £ < £q, all terms in the sum with k > £ are zero, since the matrix 
B s is lower-triangular, and therefore we have 

r-l r-1 t 

K-i(Y) < 2 J2 B si^ l) 2 +2 J2 J2 P °( D °i D s +y k )(B s (£, k)-B s (£, k+l)) 2 . 

S=TO S = T() k = l 

The key task is thus to estimate the entries B s (£, k) of the matrix product 
B s = A T _i ■ ■ ■ A s+ i, and in particular the differences \B S (£, k) — B s (£, fe + l)|. 
The recurrence satisfied by these matrix entries is that, for < j < £: 

B s ^(£,£-j) = [B s A s ](£,£-j) 
= B s (£, £ - j)A s {£ -j,£- j) + B s (£, £-j + 1)A S {£ -j + lj-j), 
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since the only non-zero entries of A s in column {£ — j) are those in rows 
(£ — j) and {£ — j + 1). Substituting for the values of these entries yields 

B s ^(£,£-j) = B s (£,£-j) (l - £ —A + B a {£,£- j + 1/ ' 



2s J y 1 J '2s 
For notational convenience, we fix t > 1 and write 

a j (s) = af\s) = B s (£,£-j), 

for s = to, . . . , t — 1 and j = —1, 0, ...,£ — 1. 
Rewriting in terms of the aj(s) gives: 

r-1 r-1 i 

< 2 ae-i(s) 2 + 2 £ ^ P S (L> S , D s + y fc )( a ,_ fe ( S ) -a^^s)) 2 . 

S = T0 S=TQ k = l 

(5.2) 

The transition probabilities P S (D S ,D S + y^) can be expressed explicitly as 
2(s-l) 

The recurrence satisfied by the aj(s) is then: 

a j( s ~ 1) = + f 1 " ^2^J a i( s )> 

for < j < £ — 1, and tq < s < r — 1. We also have -B r -i = /, the identity 
matrix, so that ao(r — 1) = 1, and aj(r — 1) = for j > 0. Note also that 
a„i(s) = for all s, since the matrix B s is lower-triangular. These boundary 
conditions, together with the recurrence relation, suffice to determine all the 
values cij(s). 

There is a natural interpretation of the term aj(s): it is the probability 
that a fixed vertex v with degree i — j at time s will have degree £ at 
time t — 1. This can most easily be seen by checking that this system of 
probabilities satisfies the boundary conditions and the recurrence relation. 
In the notation of Section [3l 

aj {s) = F(X T ^(v) = I | X s (v) = 1-3). 

One immediate consequence is that < aj(s) < 1 for all j and s. 

It may be of interest to note that there is a formula for the Oj (s) as an 
alternating sum: 

n (i-i)- 

v J 7 i=e-j v 7 «=s+i v 7 

One may verify that this formula satisfies the recurrence. It can also be 
obtained by observing that the matrices A s can be simultaneously diago- 
nalised, leading to a formula for the matrix B s . We also obtain 

v 7 i=fc v 7 u=s+l v 
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Although these formulae are quite appealing, we have been unable to extract 
useful bounds from them. 

At this point, we break into three cases. The main case of interest is when 
8 < £ < 2V>v / "7 7Tr % but we also need to deal with values of £ outside this 
range, and we do this first. 

For £ < 7, all we have to do is note that, from (|5.2|) . 

Now suppose that £ > 2ip\/r — 1. By assumption, whenever k > — 1, 
we have D s (k) = and so P S (D S , D s + y k ) = 0, and such terms contribute 
nothing to the double sum. We now need to bound the contribution of terms 
where k < ipy/s— 1 and £ > 2ip\/r — 1. To do this, we use the inequality 
(a£-k{s) — ae-k-i(s)) 2 < ae-k(s) 2 + ae-k-i(s) 2 , and bound the size of any 
term ae- k (s) subject to the given conditions. For this, we observe that 

ai-k(s) < P(Xr_i(w) >£\ X s (v) = k) 

< P(X T ^ 1 (v)>2ijV^l\X s (v)= [^v^TJ). 



We now apply Corollary[3]3j with to replaced by s, mo replaced by [tpy's — lj , 
and uj replaced by (s — 1) 1//14 . We have [ipV s ~ lj > 10 5 (s — 1) 7//14 , since 
i> > 2 x 10 5 . We also have (s - 1) 1/14 > 4 provided s > 2 28 . So, for s > 2 28 , 
we have 

ae-k(s) < e * (s ^ . 

Thus, for each s > 2 28 , 
LVv^TJ 

Ps{D s , D s + y k )a^ k {sf < e^" 1 ) 1 

k=l 

For s < 2 28 , we have 

Ps{D s ,D s + y k )a^ k {s) 2 < 1. 

fc=i 



U/X4 



Therefore 



T-l 
S=T Q 



Of-i(s) 2 + 2 P s (D s ,D s + y k )ae-k(s) 

k=l 



< 6 



V2S 



s =228_|_l 



< 2 x 10 y , 



which, since ip >2x 10 5 , is at most V' 2 - This comfortably gives the required 
result in the case where £ > 2ifj^/r — 1. 



For the remainder of this section, we assume that 8 < £ < 2^y/T^T. 
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Although our exact expression for the a* (s) proved difficult to work with, 
we now give a function fj(s) which has a simple form, and which satisfies 
the boundary conditions and an approximate version of the recurrence; our 
plan is to show that a,j(s) is close to fj(s) for all values of j and s. 

For < j < £ - 1 and < s < r - 1, set 



/iW = (V)c 



3 I " 1-3 



j J V V T — 1/ \ T — 1 

Throughout what follows, we shall set v = v s = y/s/(r — 1), so 

f J ( s ) = ( i ~. 1 )(i-vyv e - j . 



j 

We note that v T -\ = 1, and so fj(r — 1) = for j ^ 0, while /o(r — 1) = 
1. We could formally define the function /_i to be identically 0: the key 

(I — 1\ j 

identity we use for the binomial coefficients is = 7 1 

V? - V V 3 Jl-3 
which indeed entails (^Ti) = 0- However, we find ourselves having to deal 
with the case j = as a boundary case separately anyway, and so we need 
make no (further) explicit mention of the case j = —1. 
We claim that 

fib - 1) = + (1 - l -^f) m + [fj(s - 1) - f^a) + f 3 ( S ) 

for all j > 1 and 1 < s < r— 1. Our aim will then be to show that the term in 
square brackets is usually small, and that this is thus a good approximation 
to the recurrence satisfied by the Oj(s). Rearranging the claimed identity, 
we see that it is equivalent to 



3 

To verify this identity, we write 



/K*) = ^(/;(s)-/i-iO0)- (5-3) 



j *) Y s {1 ~ ^~ V ~ J_1 w - j)o- - v ) - » ( 5 - 4 ) 



.(1 _ v y-^-3 r- \(i-v) 



2s V V 3 J £-J\ 3 

3 (ft- Am j j (' ' ^ j 1 „y 1,.' 



2s VV 3 J VJ-1 



l_ v y v z-3_[~ J(i- v )J-V 
(/iW-Zi-iW). 



2s 

Equation (|5.3p demonstrates that the /j(s) are the analogues to the Oj(s) 
for a continuous time version of the preferential attachment process. In 
this continuous time version, at time s, each vertex of degree k attracts a 
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new edge (whose other endpoint is a new vertex of degree 1) at rate k/2s, 
independent of the degrees of other vertices. The degree of a given vertex 
is then a pure birth process with this transition rate. The probability that 
a vertex with degree £ — j at time s has degree £ at time r — 1 satisfies the 
differential equation (I5.3j) . as well as the boundary condition fj{r — 1) = 5jQ. 

It seems intuitively plausible that the difference ej(s) = fj(s) — a,j(s) 
between the continuous and the discrete "solutions" will always be small. 
Indeed we shall prove the following lemma, which is very crude in most 
ranges. 

Lemma 5.1. For all £ > 8 and < j < £ — 1, we have: 



\ ej (s)\ < < 



(2?00l ( r _l)/^2 <s < r _ 1 orj < 

m^Il ( T -l)/£ 3 <s<{t-1)/£ 2 



T-l 

1 r < s < (r- l)/£ 3 . 



We shall defer the proof of Lemma 15.11 to the next section. 
We set 

T-l T-l t 

^ T _ 1 (y) = 4^/,_ 1 ( S ) 2 + 4^^P s ( J D s , J D s + y fe )(/ £ _ fc ( S )-/,_ fe _ 1 ( S )) 2 . 

S=TQ S = TO k=l 

We now show that the bound in Lemma 15.11 suffices to show that 
is not much larger than (V). 



Lemma 5.2. For any £ and t, with 8 < £ < 2ip\/r — 1, 

< K-i(Y) + 5 x 10V + 20^^. 
Proof. Equation (|5.2p tells us that & T _ 1 (Y) is at most 

T-l T-l t 

2 Yl °^-i( s ) 2 + 2 S Yl Pg ( Ds > Ds + */k)(a*-k(«) - a^-fc-i(s)) 2 - 

S=T0 S=T0 fe=l 

Using the inequalities aj(s) 2 < 2fj(s) 2 + 2ej(s) 2 and 

(o^s) - ai-i(s)) 2 < 2(/ i ( S ) - fj-^s)) 2 + Ae j {s) 2 + 4e j . 1 (s) 2 , 
we deduce that 

T-l 



& T _ X {Y) < ^ T _ 1 (y) + 4^ e ,„ 1 ( s ) 2 

S = T0 

T-l £ 



S=T k=l 
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Now we apply the bounds from Lemma l5.lt 

r _l . n / „ ,„ \ 2 



/2200A 2 r-l/ 800£ 3 / 2 \ r-1 



5 x 10 6 £ 2 10 6 ^ r - 1 

- r-1 + r~^l +_ £ 3_ 

6 x 10 6 ^ 2 t - 1 

- r-1 + ^^' 

and similarly 

r-1 f 

E E P S (D S , D s + y k ) (ee- k (s) 2 + e £ _ fc _i(s) 2 ) 



+ 



P 



S = T k = l 



/2200A 2 r - 1 / 800£ 3 / 2 V r-1 



< 2 r - 1 + — — + 



\T-lJ P \ T - 1 J P 



, 6 x 10 6 ^ 2 r - 1 
£ 2 — + 



r-1 P 
Therefore 



120 x 10 6 ^ 2 r-1 



< %_ X {Y) + + 20 



r-1 P 

< *^_ x (y) + 5 x loV + 20^_ll 

p 

as claimed. □ 

For k = 1, . . . ,£, we have that P S (D S ,D S + = kD s {k) /2{s — 1), since 
each of the D s (k) vertices of degree k has probability k/2{s — 1) of receiving 
an extra edge at time s + 1. Therefore 

S=TQ S = T() k=l 

The double sum is the main term here, and we mainly concentrate on this; we 
will obtain adequate bounds on /^_i(s) 2 as a byproduct of our estimates. 



Recall our assumptions (|5.ip that D s (k) = for all k > ipy/s — 1, and 
that D s (k) < 5^ + 400?/; 2 log(^s) for all k = 1, . . . ,£ with k < Vv 7 ^ 7 !. 
Using these bounds, we find that, for all k = 1, . . . ,^o> an d all s > 4, 

7 , 9 , . , , k 

^ T2 + 550 ^ lQ g(^ S )-- 

5 1 K S 
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Thus we have 

T-l 

K-iOO < fi-ii*) 2 

S=TQ 
T-l t 

S=TQ k = l 

7-1 t , 

+ 1100V 2 log(Vr) ^ -(h-k(s) - fi-k-i(s)) 2 . 

s=to k=l 

We define 

T-l I 
S=T0 k = l 

r_1 1 h 

S=TQ k = l 

so that 

T-l 

* T -i(Y) < 4 h-M 2 + 14Qi(t,€) + 1100V 2 log(^r)Q 2 (r,*). (5.5) 

S=T 

To estimate Qi, we exchange the order of summation and substitute j = 
e-k: 

Qi(t, I) = £ E (/,(-) " /,-i( S )) 2 • 

j=0 ^ J ' S=T 

From (|5.3p and (|5.4p . we have 

f 3 (s) - f 3 ^{s) = " -L(l - ^-V-J (£(1 - w) - i) , 

where u = v s = \/s/(t — 1), as before. We estimate the sum over s by 
approximating it by the integral 

The integrand here is bounded above by 1, since each fj(s) is at most 1. 
The function (1 — , y)- ? ~ 1 ir (£(1 — v) — j) has derivative which is a positive 
multiple of a quadratic function of v, so the function has just two stationary 
points, one either side of the zero v = (£ — j)/£. Therefore the integrand, 
which is a positive multiple of the square of this function, has two local 
maxima. The sum is then at most the value of the integral plus the values 
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of the integrand at the two local maxima, and so 

In the last line, we changed variable: recall that s = v 2 (t — 1). 
We write 

0i(r,0<4 + |:|^(': 1 )Vj ) l) ) 

where 

I(£,j,a) = f 1 (l-v) 2 ^ 2 v 2 ^ +a (l(l-v)-j) 2 dv, 

Jv=0 

for positive integers I and j, and integer a, where i > j and a > —1. 

The integral above can be evaluated as a sum of Beta functions. We 
will be confronted by a very similar integral when estimating Q2, and it is 
convenient to prove a lemma covering both cases (here we need a = 1 and 
later we shall take a = — 1). 

Lemma 5.3. For integers i and j with i > j > 0, and integer a > —1, 

^ h a) ~ i2i ~(2lta?iy~ 2)l ji m ~ 3 + 1} + a(3 + a)} {j ~ 1} 
and 

/(AO, a) < l -. 

Proof. For non-negative integers a and b, we have the identity 

f (l-v) a v b dv = B(a + l,b + l)= - 
Jv=o (« + 0+1)! 

where B(-, •) denotes the Beta function. 
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For j > 1, the required integral can be written as a sum of three integrals 
of the form above, and we obtain 



I(t,j, a) 



<a _ „■ + o)l / /» _M_ - ^ M + f W " 2 > ! 



(2^ + a + l)! J (2^ + a)! J (2^ + a-l)! 
(2J-2)! 



V J ; (2£ + a + 1)! 
x {£ 2 (2j - l)(2j) - 2j£(2j - 1)(2* + a + 1) +i 2 (2£ + a + 1)(2£ + a)} 
(2£-2j + q)!(2j-2)! 



(2^ + a + l)! 



li{2^-i + l) + a(i(l + a) + 2£)} 



< (M -!;| a) ^,- 2 V w-j + l) + a(3 + a )}. 
(2« + a + 1)! 

as claimed. 

For j = 0, we have 

l(£,0,a)=£ 2 v ^dv = —- <-, 

K ' J v=0 2£ + a + l ~ 2' 

for all a > —1, also as claimed. □ 
Lemma 15.31 with a = 1, tells us that 

Qi(r,i)<4 + 2^I(*,0,1) 



^ jl{2(l-j + l) + 4} ^-l\ 2 ( 2 ^-2j + l)!(2j-2)! 

+ 2(T_1) g 1 j ) (2T+2)! 



, . T-l > ^ m - 3 + 3) {£ - l)! 2 (21 - 2j + 1)! (2j - 2)! 

P [ (l-jf (2£ + 2)\ (£-j-l)! 2 j! 2 

«"1 /n/A -1 



<+^<->e( 2 ;) 

.7=1 v 7 



1 



£(2£ + 2)(2^ + l) 



2£ - 2 A (l-j + 3)(2l-2j + l) /2 A j 
^-jj (t-3) 2 \jj2j(2j-l)- 

This is the first of several occasions in the paper where we use the in- 
equalities 

2 2x < (2x\ < 2 2x 



2y/x V X J + l' 

the first is valid for all integers x > 1, and the second for all integers x > 0. 

/2aA 2 2a: 

Sometimes, as below, we use simply that I I < —=. 

\ x I Jx 
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We obtain 

r - 1 t-k 2\/i 1 2 2e ~ 2 i 2 2j 1 

g l(T , f) < 4+ y +4(T _ 1) ^^_Ll_ 12 y 



r-1 12(r-l) ^ 1 

f3 ^3 „-3/2 1 



£3 ^3 .-3/2 W £ _ j ■ 

j=l J V J 

To estimate the sum appearing above, we use the numerical value YlJLi J~ 3 ^ 2 — 
2.61238, and the crude bound 



e/2 



and obtain 

g,(r.fl< 4 + fcil. (M) 



The next step is to estimate 



s 

S=TQ fc=l 



j=0 S = T() 

As before, we shall start by fixing j, and estimating the sum over s by the 
integral 

-T-1 



r ^(/,( S )-/ J -i( S )) 2 ^ 



-1\ 2 1 fi (i_ ^-2^-^(1 _„)_j)2 



^-1\ 2 1 



r 7(€,j,-l). 



We used the expression for fj(s) — derived earlier, and made the 

substitution s = v 2 {t — 1). 

To bound the difference between the sum X^I=t 1 (fj( s ) ~ /j-i( s )) 2 an< ^ 
the corresponding integral is not completely straightforward. The integrand 
s (fj( s ) ~ fj~i( s )) 2 can De written as 

Hj(v) = where = ^ ~ *Vl - v) j ~ V^" 1 ^ - v) -j). 

The function has stationary points at 



^ V £- 1 
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Therefore Hj(v) has a global minimum at v = {I — j)/£, and local maxima 
at the two points v*, and so the global maximum of Hj(v) is attained at one 
of the u*. For j > 1, we can write 



\hj(v*)\ 



< 



£-2 

j(£-j) LVi-i 

£-1 



(1 



j(£-j) 
£-1 



j(£-jY 



where we used the fact that the term in square brackets is the probability 
that a Binomial random variable with parameters {£ — 2, v*) takes the value 
£ — j — 1, and is therefore at most 1. Hence we have 



Hj{v) < 



£-1 1 

j(£-j)r-r 



for all j > 1 and all v. For j = 0, the maximum value of hj(v) is 1, and thus 
Hq(v) is at most for all v. 
We have that 



Q 2 (r,£) < 

j=0 

e-i 



1 



1 



3=0 

e-i 



j J (£-3? 
1 

3 J ~3 



I(£,j,-1) +2maxHj(v) 



3=1 



3=0 



£-\Y 1 



i / £-3 



i(£J,-i) + 



2£ 2 
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Now we use the bounds for I(£,j,—1) from Lemma I5\3l We also use that 
I < 2ip\/r — 1, and ip > 3, to obtain: 

2f 2 1 P 

Q2{T , e) < _1 T + ^ 



,2 1^1 



/.2 , k ^ n„/.2 



< 8^ +5 < 9-0 . (5.7) 



The next task is to bound the sum 



T-l T-l 

£/<-iW 3 = £(i 

s=r s=ro 



W ) 2£ -V, 



where, as before, t> = y/s/r — 1. This sum is bounded above by the integral 
Jg = Q(l—v) 2 ^~ 2 v 2 dv, plus the maximum value of the integrand. The integral 
is equal to 

which is more than small enough for our purposes, and the integrand is 
certainly at most 1, so 

J>-i(*) 2 <l + ^. (5.8) 

S=TQ 
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Finally, we combine all our estimates. For any t = 1, . . . ,£q we have, by 
LemmaEa (pT5]l . (pT5|) . (pT7j) and (jO]h that 

r - 1 



$r-i( y ) < ^t-i( y ) + 5 x 10fy 2 + 20 



^ 3 

r-l 

< 4 ^ /^(s) 2 + 14Qi(t,£) + 1100^ 2 log(^T)Q 2 (r,^) 

S = T0 

+ 5 x 10V 2 + 20^-^ 

< 4 + ~ + 56 + 1400^3^ + 9900V; 4 log(^r) 
+ 5 x 10V 2 + 20^-^ 

< 1600^-^ + 10V log(^r). 

£ 6 

Thus <I>^_ 1 (y) < Br, as required. This completes the proof of Lemma B~2l 
except for the proof of Lemma 15. 11 to which the next section is devoted. 

6. Proof of Lemma I5TTI 

Our aim in this section is to prove the following upper bound on e,-(s) = 
fj(s) — aj(s), to be valid whenever £ > 8, < j < £ — 1 and tq < s < r — 1: 

( 220M ( T -l)/£ 2 <s<t -lor j < £-2 



|ej(s)| < < 



%^ (r - l)/^ 3 < s < (r - l)/£ 2 
1 t < s < (r - l)/£ 3 . 



The final case is straightforward, since both aj(s) and lie between 

and 1 for all < j < £ — 1 and tq < s < t — 1. So from now on we assume 
that, if j = £ - 1, then s > (r - l)/£ 3 . 

Recall that aj(s) and /j(s) satisfy the recurrences: 

for all j > 1 and all s with tq < s < r. 

The term in square brackets is, by Taylor's Theorem, equal to ^f"(w) for 
some w 6 (s — 1, s). We will thus estimate it by bounding the absolute value 
of the second derivative of fj . 

Lemma 6.1. For all 1 < j < £ — 2 and tq < s < r — \, 

140 (l-l) 5 / 2 
'^^'^ .(r-l)^-^/ 2 - ^ 
This bound also holds if j = £ — 1 and s > (r — l)/^ 2 . 
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3 ^ „ ^ t„ l\//)2 



For (r - I) /I 6 < s < (r - I) /I 1 , we have 

\f'U(s)\<l^/s{T-l). 

Proof. Using the expression for f'As) in (|5.4h . as well as the identities s 
v 2 (t — 1) and (£ — — v) — jv = £{1 — v) — j, we can write 



and then we have: 



1\ 1 



j /2(r-l) 



(l-v) j ~ 1 v e - j ' 2 (£(l-v)-j), 



f'(s) 



1 \ 1 



.(i_ w )i-V^'- 3 



j ;2(r-l)2s 
x [(£(1 - u) - j){{£ - j - 2)(1 - v) - (j - l)v} - iv(l - v)] 



j /4s(r-l) 



(i - v y- 2 v 



{(e-i)(i-v)-j} 2 -(i-v) 2 -vj 

Let us first verify the result for j = 1, when we can write 

4 S (r - l)f» (s) = (£- l)v e - 3 [(£ - 1)(£ - 3) - v£(£ - 2)]. 

The right-hand side is increasing from v = to v = (£—l)(£ — 3) 2 /£(£ — 2) 2 , 
and decreasing thereafter. It is thus always at least its value at v = 1, which 
is —(£ — 1)(2^ — 3), and at most its value at the stationary point, which is 
at most 



l)l E ~ 3 (£-l)(£-3) I 1 



and thus 



£-3 



1 



l) 2 (l~3) 
(£-2) 



<(i-l) 2 , 



2s(r - 1 



-(£-l) 2 , 



which is as required. 

We now embark on the calculation for 2 < j < £ — 2. We define a 
parameter ip = ip(v) by v = {I — j — 2 — tp)/(£ — 4), so 1 — t; = (j — 2 + 
tp)/{£ — 4), and — (j — 2) < <p < £ — j — 2. The point is that the "main term" 
(l — vy^v^i' 2 in our expression for the second derivative of / is maximised 



at (p = 0, whereas the other term 
small for small <p. We write 



is 



4s(r - = fcjfcafca, 
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where 



fci = 



£-l\ fj-2\ j ~ 2 f£-j-2^ £ - j - 2 



j J \£-Aj v 



\ j-2 / \ e-j-2 



£-4 V V £-4 / J £-4 



If j = 2 or j = £ — 2, the terms in fci and &2 with a power of j — 2 or 
£ — j — 2 respectively are treated as equal to 1, and therefore absent from 
the products. We shall estimate fci, ki and ks separately to start with, and 
then consider A^A^. 
We have 



fci 



£-l\ fj-2\ j - 2 f£-j-2^ £ - j - 2 



j J \£-4j V 
(£-!)(£- 2)(£ - 3) f(£-4)\e^ 4 \ ( {j - 2)^ 2 



j(j-l)(£-j-l) \ {l-±y-* J \(j-2)\ei-i 
{£-j-2f-^ 
(£- j -2)le*-j- 2 



Again, if j — 2 or £ — j — 2 is zero, the related term is absent (i.e., the ratio 
is equal to 1). We now use the inequalities 



Vx+ 1 (^y <x\< z^/x (^y , 



to obtain that 



(l-l)(l-2)(l-3) / £-4, 

ju-w-j-i) Vu-w-j-iy 



(Note that this remains valid if j = 2 or j = £ — 2.) 

We next consider hi- We assume for the moment that j < £/2 (the other 
case is symmetric) and distinguish two ranges. First, we consider the case 

2 

where \(p\ < j — 2. In this case, we use the bound log(l + x) < x — valid 
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for all \x\ < 1, and obtain: 



log/c 2 = (j-2)log 1 + 



<P 



3-2 



+ (£-i-2)log 1 



e-j-2 



< (i-2) 



+ (^-i-2) 

.2 / 1 



j - 2 4(j - 2)2 



+ 



e-j-2 4(i-j-2y 
i 



4 Vi-2 e-j-2 
2 e-4 



r 



4 (j-2){l-j-2Y 



In the case where ip = a(j — 2) with a > 1 and j > 2, we estimate 



1 + 



J-2 



i-2 



9? 



e-j-2 



e-j-2 



If j = 2, then k 2 is just (1 - ip/(l - 4))^" 4 , which is at most < (2/e)v. 
There is a final case where <£> = — {j — 2), i.e., i> = 1, and we may dispose 
of this immediately since the second derivative is zero unless j = 2. In 



e-4 



summary, fc 2 < exp v 4 ( J _ 2)(£ _ J _ 2 ) 
and fc 2 < (2/e)M otherwise. 

Let us organise ks as a quadratic in ip: 



if M <min(j-2,£-i-2), 



<^ 2 ^ - 2) - v?(4^ 2 - 7<7 - 8^ + 12j) 



- 4) 2 

+ ((2£ - 3j - 2) 2 - (j - 2) 2 - j(£ — A){e — j — 2)) 



Using the inequality 



\ap 2 + bp + c\ <2ap 2 + b 2 /4a + \c\, (6.2) 
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valid for all positive a, and all real b, c, ip, we have, for all £ > 8, 

21(1 - 2) 2 (4l 2 -7lj-8l + 12j) 2 
3 " 4^-2)(£-4) 2 

|(2l-3j-2) 2 -(j-2) 2 -j(l-4)(l-j-2)| 



+ 



2 (2(l-4)(2l-3)-(j-2)(7l-12)) 2 
- ^ 4^-2)(^-4) 2 

,. , ' |2^-3j-2| \ 2 + A-2\ 2 + j(£ - j - 2) 



£-4 J \£-4 £-4 



< 6ip 2 + 4 + 4 + 1 + ^ 7 ' ? ' 



4 



Now we combine our bounds to produce a single bound on ^2^3- The 
j(£ — j — 2) j(£ — j — 2) 
product of &2 with 9 H — is certainly at most 9 H — < 

j(£ —7 — 1) 

10 — , while the product of k,2 with 6(p 2 is at most the maximum 



of 6tp 2 I - I < 35 < 35 ^^ J - and 

2 ( \^ 24 (j-2)(l-j-2) j(£-j-l) 
* 6XP l-T (;-2)«-;/-2) J * T ^4 < 35 ^3^ 

We can summarise by saying that, provided £ > 8, for all values of ip, 

k2k3 < 45 ^1> 

Therefore, overall, we have, for 2 < j < £ — 2, 



n M /o C /o.^-i-!) 



nif'Mi < 10t - (^-i)(^-2)(€-3) 1-4 j(i-j-i) 

(£-l)(£-2){£-3) 1 1 



= 135- 
< 560- 



(j - 1) 3 / 2 V^-i-i 



provided ^ > 8. 
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Consider now the special case j = I — 1, when the bound (16. ip translates 
to 4s(r - l)|/"(s)| < 560(^ - 1). Using (|6^ . we have 

4^-1)1/^(5)1 = ^—^tl\{£-l)W-{£-l) v -{l- v f\ 

< {l ~ vY " fa-i)V + ^ 

< 2(^-l)Vl-^- 3 + 2(1 ~^' 3 . 
The first term above is maximised at v = l/(£ — 2), so we have 

2(/ - l) a «(l - ^ 3 < 2 { ^f (l - < 2(t - 1). 

If s > (r — l)/^ 2 , then v > l/£, and the second term above is at most 
2£(l-v) £ - 3 < 3(£-l), so we do have 4s(r - l)|/j'(s)| < 5(^-1), as desired. 

For the range (r — l)/£ 3 < s < (r — l)/£ 2 , we have v > £~ 3 / 2 . This gives 
4s(r - l)|//_ x (s)| < 2(* - 1) + 2£ 3 / 2 , and thence |/iL x (a)| < £ 3/2 /s{t - 1), 
as claimed. 

This completes the proof. □ 

We are now ready to bound the difference Cj(s) = dj(s) — fj(s). Recall 
that we have, from comparing the recurrences satisfied by the two systems: 

e 3 (s - 1) = (l - ^) e 3 {s) + ^ViOO " [Ms - 1) - f 3 (s) + f^s) 
For < j < £ — 1 and To < s < r — 1, set 

^ / /> \ 3/2 

We now use induction on j and r — s to show that |ej(s)| < Cj—^j, for all 
< j < ^ — 2 and To < s < r — 1. (We shall return to the case j = £ — 1 
afterwards.) 

£ 

We first check the inequality |eo(s)| < for j = 0. We have that 

r — 1 

ao(r — 1) = 1, and, for to < s < t — 1, 

a (s - 1) = M - — j a (s), 



so 



while 



II (i-s;)- 
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Thus we have 

Ms) _ W (i - y w y' 2 

aoW w i} +1 (l-£/2w) ■ 

Each term in the product is clearly at least 1, so fo(s) > ao( s ) f° r an s - If 
s < £, then we certainly have |eo(s)| < /o( s ) < s /( r ~~ 1) < ^/( T — 1)j so we 
may assume that s > Now we have, for all w > s > 2£, 

(i - i/wY' 2 i - ^ + A 1 ^ 2 ^ 2 / £ 2 

< 2w 8w < 1 + 5 = 1 + — ? < exp ' 



1 - £/2tt; " 1 - * ~ 1 - 1/2 8w 2 ' 4w 2 ~ 1 \4w 2 



2w 

This means that 

r-l 



w=s+l 

Now we write 

|eo(s)| = fo(s)(l-a (s)/f (s)) 

1/2 

< 



\ / 

-J (l-exp(-£ 2 (r-l-s)/4s(r-l)) 



< 



r 



.t-1/ 4s(r-l) 

This function is maximised at s = (1 — 2/£){t — 1), and its value there is 
equal to 

(1 - 2/l)^/ 2 ~ 1 l 2 (2/l) £ 

4(r - 1) ~ 2(r- 1) ' 

as required for the case j = 0. 

For j > 0, we have e^r - 1) = Oj(t - 1) - /, (r - 1) = - = 0. Now, for 
the induction step, suppose that < j < £ — 2, tq < s < r — 1, and that we 
have verified our inequality for both |ej_i(s)| and |ej(s)|. Hence we have 

M*- 1)| 



< 



+ |/ J .( s -l)-/ j ( a ) + /'( s )|. 



We now note that fj(s — 1) — fj(s) + fj(s) = f"(w)/2 for some w in 

70 £ 5 / 2 

(s — l,s), and so its absolute value is at most — ; — „,„.„ -77^, by 

s(t — 1) y I {£ — 3)1 
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Lemma 16. 11 Therefore 

i \(^ l-i\r, Z~j r 70 I 3 / 2 
t-1 \ 2s 2s Gj - 1+ ap/*{£-jy/ 

f/2 V 



| ej -(a-l)| < 



T-1 

I 



c J + -or\ ~ C i + + 14 °" 



2s j 3 / 2 (i-j) 3 / 2 



where the last line is by the definition of the Cj's. 

For j = 1 — 1 and s — X > (t — X)/£ 2 , the same calculation still gives us that 
\ e e-i($ — 1)1 < C(-\1/{t — 1), since the bound (|6.ip is valid for as 
long as w > (t-1)/£ 2 . For values of s— 1 with (r-1)/^ 3 < s-1 < (r-1)/^ 2 , 

we replace the bound on the second derivative by \f"(w)\ < — -y, and 

the same calculation gives 

I £ 3 / 2 
\ee-i(s)\ <C/_ 2 t + 



T-1 T-1 

^ £ 3 / 2 

We now observe that the sum y, ^W72 7 vU2 ls um f° rrm y bounded, 

i=l 1 

being at most 2 > -— y— < 15. Therefore we have 

, ,. 2200^ 

for all < j < £ — 2 and tq < s < r — 1, and also for j = £ — 1 and 

(T - l)/^ 2 < S < T - 1. 

For (r - l)/^ 3 < s < (r - l)/£ 2 , we have 

, sl 2200^ ^ 3 / 2 800£ 3 / 2 
\et-i(s)\ < + < -, 

T — 1 T — 1 T — 1 

since £ > 8.0000 

This completes the proof of Lemma 15.14 and hence in turn the proofs of 
Lemma 14.21 and Theorem 11.11 



7. Concentration for U t (£) 

In this section, we give a very brief sketch of the proof of Theorem 11.21 
The proof proceeds on very similar lines to that of Theorem 11.11 

Recall that Ut{£) is the number of vertices of degree at least £ at time t. It 
is easy to show that the expected value of U t {£) is close to u t (£) = 2t/£(£+X), 
uniformly over t and £. 
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The difference Ft(£) = Ut(£) — Ut(£) satisfies the matrix equation 

F t = Wt-iFt-i + AMj. 
Here AMt is a vector of martingale differences, and 



/I 



_2_ 

2u 



J_ 

2u 







1 



2u 



o \ 













2m x 



2u 



7 



The indexing of vectors and matrices runs from £ = 2 to £ = £q\ we do not 
need to track the component £ = 1 since Ut(l) = t for all t. As for Dt, the 
plan is to apply Theorem 12.51 to the martingale 



M, 



t 

C s _iAM s 

s=r + l 



where C s = YluZ s+ i W u , for fixed r > r . 

A transition of £7 S involves an increase of 1 in at most one component 
U(£); in other words U s +i is obtained from U s by adding some unit vector 
e£, or leaving the vector unchanged. 

For £ = 2, . . . , £q, and tq < t < r, we set 



t-i 



s=tq k=2 



We set, for each 1 < £ < £q, 



S e = Tlh—^- + 10 1 V iog 13 0H 

and 

T| = inf{t > t : > 
An application of Theorem 12.51 gives 



T <t<T 



sup \M{(£)\ > 3y/\ogtyT)S e ) A (T| > T) \ < 

for each t > tq and 2 < £ < £$. 

Now let T be the infimum of the times s > tq such that either there is a 
vertex of degree at least ipy/ s — 1, or, for some k < £q, 



U a (k) 



2s 



k(k + l) 



> A5 V s ^js) + 4 x lo9v , log7(V)s) 



The next step is to prove the following result, analagous to Lemma W. 
Lemma 7.1. For all t < f, and all £ = 2,...,£ , <§>\_ X {Z) < S l . 
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The result implies that T| > T for all I = 2, . . . , £ . 
Now, for each r > tq and each £ > 2, set 

S r {£) = 45 V^Og^T) + g x w 9^ log 7 {lfjT ^ 

We have 

In particular, together with Lemma |7. II and the fact that MJT = _F T for each 
T > r 0i this implies that 

v((\F t (£)\>6 t (1))A(T>t))<^. 

We next use this inequality to show that, with probability at least 3/ip, for 
all r > To and I = 2, . . . , £q, either |i<V(£)| < 5 T (£) or r > T. Similarly to the 
proof of Theorem 11.11 this leads to the conclusion that P(T < oo) < 4/tp, 
which is the desired result. 

8. More complex preferential attachment models 

In this section, we discuss some of the issues we confront when extending 
this proof to other models of preferential attachment. 

A first extension would cover the model which again generates a random 
tree, where now an arriving vertex chooses an existing vertex v as a neigh- 
bour with probability proportional to X(v) + (3, where X(v) is the degree of 
vertex v, and /3 is a fixed constant. For such a model, the expected degree of 
a vertex at time t grows as Ct 1 ^ 2+l3 \ and the expected number of vertices 
of degree £ at time t behaves as Ct/£^ + ^ . When attempting to follow the 
proof in this paper to establish concentration results, the main difficulty is 
in finding a suitable analogue of Lemma 15-H giving bounds on the error 
function playing the role of Cj(s). 

Another well-studied variant is to have each arriving vertex select some 
fixed number m of neighbours (with replacement), instead of just one. The 
main difficulty introduced by this variation is that we have to account for 
the possibility that some existing vertex has its degree increased by more 
than one at each step, and that the recurrence relations do not have such 
clean forms. 

In the full Cooper- Frieze model (see [6j, [5]), the number of new edges 
added at each step is a random variable. Indeed, with some probability, no 
new vertex is added, and some edges are added between existing vertices, 
chosen either uniformly or via preferential attachment. This means that the 
numbers of vertices and edges present at time t are no longer determined, 
causing further complications in the application of our method. 

We do believe that all of these problems can be overcome, and that our 
method can be used to analyse general Cooper-Frieze models. We also hope 
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that the method will find further applications in the analysis of other random 
processes. 

References 

[1] Athreya, K.B., Ghosh, A. P. and Sethuraman, S. (2008). Growth of preferential at- 
tachment random graphs via continuous-time branching processes. Proc. Indian Acad. 
Sci. (Math. Set.) 118 473-494. 

[2] Barabasi, A.-L. and Albert, R. (1999). Emergence of scaling in random networks. 
Science 509-512. 

[3] Bollobas, B. and Riordan, O. (2002). Mathematical results on scale-free graphs. Hand- 
book of Graphs and Networks (S. Bornholdt and H. Schuster, eds.), Wiley- VCH, 
Berlin. 

[4] Bollobas, B., Riordan, O., Spencer, J. and Tusnady, G. (2001). The degree sequence 
of a scale-free random graph process. Rand. Struct. Alg. 18 279-290. 

[5] Cooper, C. (2006). Distribution of vertex degree in web-graphs. Combin. Probab. 
Cornput. 15 637-661. 

[6] Cooper, C. and Frieze, A. (2003). A general model of web graphs. Rand. Struct. Alg. 
22 311-335. 

[7] Darling, R.W.R and Norris, J.R. (2008). Differential equation approximations for 

Markov chains. Probability Surveys 5 37-79. 
[8] Deijfen, M., van den Eskcr, H., van der Hofstad, R. and Hooghiemstra, G. (2007). A 

preferential attachment model with random initial degrees. Arkiv for Matematik 47 

41-72. 

[9] Dereich, S. and Morters, P. (2009). Random networks with sublinear preferential 
attachment: Degree evolutions. Elec. J. Probab. 14 1222-1267. 
[10] Ethier, S.N. and Kurtz, T. (1986). Markov processes: characterisation and conver- 
gence. Wiley. 

[11] Janson, S. (2005). Asymptotic degree distribution in random recursive trees. Rand. 
Struct. Alg. 26 69-83. 

[12] Lu, J. and Feng, Q. (1998). Strong consistency of the number of vertices of given 
degrees in nonuniform random recursive trees. Yokohama Math. J. 45 61-69. 

[13] Luczak, M.J. (2003) . A quantitative law of large numbers via exponential martingales. 
In Stochastic Inequalities and Their Applications. Progress in Probability 56 93-111, 
Birkhauser. 

[14] Luczak, M.J. and McDiarmid, C. (2006). On the maximum queue length in the 
supermarket model. Ann. Probab. 34 493-527. 

[15] Luczak, M.J. and Norris, J.R. (2005). Strong approximation for the supermarket 
model. Ann. Appl. Probab. 15 2038-2061. 

[16] Mahmood, H.M. and Smythe, R.T. (1995). A survey of recursive trees. Theory Prob- 
ability Math. Statist. 51 1-27. 

[17] Mahmood, H.M., Smythe, R.T. and Szymariski, J. (1993). On the structure of random 
plane-oriented recursive trees and their branches, Rand. Struct. Alg. 4 151-176. 

[18] McDiarmid, C. (1998). Concentration. In Probabilistic Methods for Algorithmic Dis- 
crete Mathematics. (M. Habib, C. McDiarmid, J. Ramirez and B. Reed, eds.) 195 - 
248. Springer - Verlag, Berlin. 

[19] Mori, T.F. (2005). The maximum degree of the Barabasi- Albert random tree. Combin. 
Probab. Cornput 14 339-348. 

[20] Norris, J.R. (1997). Markov chains. Cambridge University Press. 

[21] Rudas, A., Toth, B. and Valko, B. (2007). Random trees and general branching 
processes. Rand. Struct. Alg. 31 186-202. 

[22] Szymariski, J. (1987). On a nonuniform random recursive tree. Annals Discrete Math. 
33 297-306. 



52 



GRAHAM BRIGHTWELL AND MALWINA LUCZAK 



[23] Szymariski, J. (2005). Concentration of vertex degrees in a scale- free random graph 

process. Rand. Struct. Alg. 26 224-236. 
[24] Talagrand, M. (1995). Concentration of measure and isoperimetric inequalities in 

product spaces. Publ. Math. Institut des Hautes Etudes Scientifiques 81 73-205. 
[25] Yule, G. (1925). A mathematical theory of evolution based on the conclusions of Dr. 

J. C. Willis Phil. Trans. Roy. Soc. London B213 21-87. 

Department of Mathematics, London School of Economics, Houghton Street, 
London WC2A 2AE, United Kingdom 

E-mail address: g.r.brightwell@lse.ac.uk 

URL: http : //www . maths . lse . ac .uk/Personal/graham/ 

School of Mathematics and Statistics, University of Sheffield, Hicks Build- 
ing, HOUNSFIELD RD, SHEFFIELD S3 7RH, UNITED KINGDOM 
E-mail address: m.luczak@sheffield.ac.uk 



